Harvesting Indices to Grow a Controlled Vocabulary: Towards Improved Access to Historical Legal Texts

نویسندگان

  • Michael Piotrowski
  • Cathrin Senn
چکیده

We describe ongoing work aiming at deriving a multilingual controlled vocabulary (German, French, Italian) from the combined subject indices from 22 volumes of a large-scale critical edition of historical documents. The controlled vocabulary is intended to support editors in assigning descriptors to new documents and to support users in retrieving documents of interest regardless of the spelling or language variety used in the documents.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Porting Elements of the Austrian Baroque Corpus onto the Linguistic Linked Open Data Format

We describe work on porting linguistic and semantic annotation applied to the Austrian Baroque Corpus (ABaC:us) to a format supporting its publication in the Linked Open Data Framework. This work includes several aspects, like a derived lexicon of old forms used in the texts and their mapping to modern German lemmas, the description of morphosyntactic features and the building of domainspecific...

متن کامل

The effects of captioning texts and caption ordering on L2 listening comprehension and vocabulary learning

This study investigated the effects of captioned texts on second/foreign (L2) listening comprehension and vocabulary gains using a computer multimedia program. Additionally, it explored the caption ordering effect (i.e. captions displayed during the first or second listening), and the interaction of captioning order with the L2 proficiency level of language learners in listening comprehension a...

متن کامل

Reduplication in Persian Language and Literature

The Reduplications are made by repeating part of the base. The repeated part does not make sense and will never be used alone and is just popular in spoken language. In recent times, they have been used in some texts of poetry and prose, in particular, in stories written in vernacular. This research, with a historical approach, and with an analytical-explanatory method, examines the information...

متن کامل

The Effect of “Narrow Reading” on Learning Mid-Frequency Vocabulary: The Role of Genre and Author

This study investigated the effect of Narrow Reading (NR) on learning mid-frequency words. Vocabulary Size Test (VST) designed by Nation and Beglar (2007) was administered as the first pre-test to 196 students, from among whom 91 students whose vocabulary size ranged between 2100- 3500-word families, , became the target of this study and were randomly c...

متن کامل

Quantifying Text Difficulty with Automated Indices of Cohesion and Semantics

We evaluated the effectiveness of new indices of text comprehension in measuring relative text difficulty. Specifically, we examined the efficacy of automated indices produced by the web-based computational tool Coh-Metrix. In an analysis of 60 instructional science texts, we divided texts into groups that were considered to be more or less difficult to comprehend. The defining criteria were ba...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012